Nearest neighbors and correlation dimension for dimensionality estimation. Application to factor analysis of real biological time series data

نویسندگان

  • Jérôme Lapuyade-Lahorgue
  • Ali Mohammad-Djafari
چکیده

Determining the number of components in dimensionality reduction techniques is still one of the open problems of research on data analysis. These methods are often used in knowledge extraction of multivariate great dimensional data, but very often the number of components is assumed to be known. One of the classical methods to estimate this dimensionality is based on the Principal Components Analysis (PCA) eigenvalues [1, 2]. However, this method supposes that the model is linear and the signals are Gaussian. To be able to consider non-linear and non-Gaussian cases, we propose in this paper “measure based methods” as nearest neighbors dimension and correlation dimension. The comparaison between the three methods is evaluated both with simulated data and with real biological data, which are gene expression time series. The main goal of this study is to estimate the minimum number of factors.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Chaotic Analysis and Prediction of River Flows

Analyses and investigations on river flow behavior are major issues in design, operation and studies related to water engineering. Thus, recently the application of chaos theory and new techniques, such as chaos theory, has been considered in hydrology and water resources due to relevant innovations and ability. This paper compares the performance of chaos theory with Anfis model and discusses ...

متن کامل

On the Predictability of Price Fluctuations in Tehran Stock Exchange A Correlation Dimension Estimation Approach

This paper employs a general non-linear analysis tool to analyse the nature of time series associated with the price (returns) of a particular company in Tehran Stock Exchange. It is shown that the behavior of the process associated with the price (returns) time-series of this company is weakly chaotic, and due to the non-random behavior of the process, short term prediction of stock price is p...

متن کامل

On the Predictability of Price Fluctuations in Tehran Stock Exchange A Correlation Dimension Estimation Approach

This paper employs a general non-linear analysis tool to analyse the nature of time series associated with the price (returns) of a particular company in Tehran Stock Exchange. It is shown that the behavior of the process associated with the price (returns) time-series of this company is weakly chaotic, and due to the non-random behavior of the process, short term prediction of stock price is p...

متن کامل

Anisotropic k-Nearest Neighbor Search Using Covariance Quadtree

We present a variant of the hyper-quadtree that divides a multidimensional space according to the hyperplanes associated to the principal components of the data in each hyperquadrant. Each of the 2 hyper-quadrants is a data partition in a λ-dimension subspace, whose intrinsic dimensionality λ ≤ d is reduced from the root dimensionality d by the principal components analysis, which discards the ...

متن کامل

A Novel Fuzzy Based Method for Heart Rate Variability Prediction

Abstract In this paper, a novel technique based on fuzzy method is presented for chaotic nonlinear time series prediction. Fuzzy approach with the gradient learning algorithm and methods constitutes the main components of this method. This learning process in this method is similar to conventional gradient descent learning process, except that the input patterns and parameters are stored in mem...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2011